Lecture 5 

Mathematical tools for image processing, 

continued 



Spatial versus transform operations 

Spatial operations are performed in spatial domain 

- Image is represented as f(x,y) 

- X=0,1,2,...,M 

- Y=0,1,2,...,N 

- Single pixel based operations, neighborhood operations, 
geometric transformation 

Transform operation 

- Image is transformed to other domain g(u,v) (Transformdomain) 

- Operations are performed in the other domain 

- Transformed operated image is inverse-transformed to the 
spatial domain 



Single pixel spatial operations 


Intensity mapping 
s=T(z) 

z: input image intensity 
s : Output image intensity 
T : the operation 

Example: getting the negative of 8 bit image 


j = T(z ) 
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T(z)= 2 8 -z 
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Neighborhood spatial operations 


The value of the pixel at x,y in the output 
image is calculated from a set of 
neighbor pixels centered at x,y in the 
input image 


Example, the calculated value = the 
average of the neighbors rectangle of 
dimensions M by N centered at the pixel 


g(x,y) = 


l 

MN 


(r,c)eS xy 


r, c are row and column index in the MxN neighbor set S 



Geometric spatial transformation 

Geometric spatial transformation consists of two basic operations:-|)spatiaI transformation of coordinates. 
Geometric transformation is the mapping of the 2 ) lntensit y interpolation 

. , u,varepixel 

coordinates of each pixel in an input image to another coordinates in 


(displaced/rotated,..) pixel in the output image. 

Intensitv interpolation is used to assign intensities for the 

relocated pixels in processed image 

Forward mapping: for each pixel in the input image, find 


(x,j) = r{(M,v)} 




the original image,while 
x,y are the corresponding 
pixel coordinates 
in the transformed image 


(shrinking) 


its location in the output image and assign its value Affine tranformation 


• Multiple output values are assigned to the same 
output pixel (Overlaps) 

• Some output locations may not assigned values (Holes) 
Inverse mapping: for each pixel in the output image, find 
the location in the input image by applying the inverse 
transform, use the interpolation to calculate intensity 
value based on the intensities in the input image location 

(u,v)=r 1 (x,y) 

Inverse mapping avoids holes and overlaps in the output image. 

So it is more efficient than forward mapping and preffered in 

geometric transformation applications(such as Matlab). 


[x, y,l\ = [v, w,l]r 
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T is the transormation matrix 


This transformation can scale, rotate, translate 
a set of pixels, depending on the chosen values 
of the elements of a matrix T. 


Common affine transformation matrices 


Affine transformations based on Eq. (2.6.-23). 


Transformation 

Name 


Identity 


Scaling 


Rotation 


Translation 


Shear (vertical) 


Shear (horizontal) 


Affine Matriv. T 


Coordinate 

Eqnations 


Example 
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x = c x v 
y = c y w 


x = v cos 0 — w sin 8 
y = V cos 9 + w sin 8 
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X = V + t x 
y = w + ty 
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X = V + s v w 
y = w 
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Effect of interpolation 
method on geometric 
transformation (rotation of 
21°) using inverse mapping 

(a) Original image 

(b) rotation / interpolation using 
nearest neighbor 
interpolation 

(c) rotation / interpolation using 
bilinear interpolation 

(d) rotation / interpolation using 
bicubic interpolation 
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Image registration 

(used in image comparison) 

Different images taken for the same scene 

At different times (satellite image for monitoring earth movement) 
With different viewing angles/ distances 


Required: Aligning the images so that measurements and comparison could be made 

ln image registeration, Input and output images are known and transformation matrix T is unknown. 


Reference image (to align to) 


Aligned image 



Input image (with some transformation) 
(ex. Different view/distance/orientation) 


The process 

aligns/registers the input 
image against the 
reference image 
Find the transformation 
T( reference image to 
input image) then apply 
T _1 on the input image 



Image registration 

A simple model for registration 
x = c l v-\- c 2 w + c 3 vw + c 4 
y = c 5 v + c 6 w + c 7 vw + c 8 
where x, y are refrence coordinates 
and v, w are the input image coordinates 

Apply these two equations at four tie points to obtain 8 equation 
in eight uriknowns, solve them for the unknowns=> you got the 
model 

Use the model to transform a II the input image to the reference 
image 

After transforming the coordinates, interpolation is used to assign 
intensity values 



(a) Reference image 

(b) input image 

(c) Registered image 

(d) Difference between a,c 
indicating some errors 
due to registration 


Vector and matrix operations 


Color image can be represented as three 
image components, each in a different 
spectrum, for the same scene 
In such multispectral image, each pixel is 
represented by a vector in n-dimensions 
space (3D in RGB case) 

Vector theory could be applied on such 
images, for example, the distance 
between a pixel vector and a point in the 
RGB "D space is given by the shown 
equation 

The RGB 3D space representation can be 
generalized for n dimensions space (ex 
multispectral images of NASA LANDSAT 



Euclidean distance between a pixel z apd a point a : 

D(z,a) = \z-af (z-a ) 2 
— — ^r) + (z G — a G ) +(z B ~a B ) 


= \\z-a 


Linear transformation and linear process 


• N dimensions space multispectral 
image is transformed into m 
dimensions space by applying a linear 
transformation w=A(z-a) where 

• z is the n by 1 pixel vector in the 
input image(vector for each pixel) 

• a is n by 1 vector 

• A is m by n transformation matrix 

• w is the resulting pixel vector in 
the m dimensions space 

• Linear transformation has many 
applications in image processing 


• A gray scale image is represented by a 
vector of MN by 1 by putting the image 
intensities in this vector row by row 

• A linear process on such image is 
represented as g=Hf+n, where 

• f is MN by 1 vector (one vector for the 
image) 

• H is MN by MN operation (process) 
matrix 

• ñ isan MN by 1 noise vector 

• Linear processes has many applications in 
image processing 






Report discussion 


Last lecture report: Problem 2.22 


New report: Problem 2.26 




Image transform 



Spatial 

domain 


Transform domain 


Spatial 

domain 


Linear two - dimensions general transform 

M - 1 N - 1 

T(u,v ) - II f(x, y)r(x,y,u,v ) 
f(x,y) = YTL T(u,v)s(x, y,u,v) 


x=0 y=0 

and its inverse transformis 

M-lN-l 


Transform Pair 


i = 0 v=0 


where r(), s() are the forward and inverse transformation kernels 


if 

r(x, y,u,v) = r x (x,u)r 2 (y ,v) 
Thekernel is separable 
in addition, thekernel is symetric 
if r p r 2 are functionally equals 


Note that each transformed pixel is calculated from all pixels in the input image and 
each inverse transformed pixel is calculated from all pixels in the transformed image 


Example for linear two dimensions transform; 

Fourier transform 


M-lN-l 

T(u,v) = ^^f(x,y)e 

x=0 y = 0 


_ ux vy. 
_j2k ( — +— ) 
M N 


M-lN-l 

f(x,y) = ^^T(u,v)e M N 

u = 0 v=0 



(a) Image corrupted 
by sinusoidal 
interference. (b) 
Magnitude of the 
Fourier transform 
showing the bursts 
of energy responsible 
for the interference. 
(c) Mask used to 
eliminate the energy 
bursts. (d) Result of 
computing the 
inverse of the 
modified Fourier 
transform. (Original 
image courtesy of 
NASA.) 



Transform matrix notation 


If 

1- both the forward and the reverse kernels are separable and symmetric 

2- The operated image is square M by M 

The transform can be put in matrix notation as follows 

T=AFA , F is an M x M matrix containing the elements of the image f(x,y), 

Where T is the result , A is M by M matrix whose elements ajj—r^fij) 

to obtain the inverse transform, 

Multiplying by B from both sides 
BTB=BAFAB 

If B=A _1 , then the inverse transform is calculated as 

F=BTB , where Matrix F contains the elements(pixels value) of the image f(x,y). 

indicating that image f(x,y) can be recovered completely from its forward transform(T).lf B is not equal to the inverse 
of A, you can get an approximation for f(x,y): 

F=BAFAB 


Probabilistic models 


if intensity values are treated as random quantities 
Possible intensities z t , i - 0,1,2,..., L-1 
) = ^- z k happened 


MN 


L - 1 


Z^) = 1 


k=0 


iSpread of pixel intensities arround their mean 


L-1 

cr 2 = ^ (z* -m) 2 p{z k ) = vaviance 

k = 0 

^Jvariance = cr = standard deviation 
variance and standard deviation are measures 
for image contrast 


mean of an image=m=??? 



a b c 


Image 

SD 

Variance 

Description 

a 

14.3 

204.3 

Low 

contrast 

b 

31.6 

997.8 

Medium 

contrast 

c 

49.2 

2424.9 

Heigh 

contrast 







Probabilistic models 

L - 1 

a 2 = y (z k -m ) 2 p(z k ) = variance = Second moment 

k=0 

In General the nth moment is calculated as 
o" n =S a t -m)"p(z t ) 

k=0 

There are some relation to the moment value and pixel values in an image 
Positive third moment=> pixel values have bias to values higher than the mean 
Negative third moment=>pixel values have bias to values smaller than the mean 
Zero third moment=> pixel values are distributed uniformly the mean 


Level of mathematics needed 

Treating pixel intensity as random variable: random variable theory 
Sequence of image whose pixel intensities are treated as random variable : 
Stochastic process theory 






Lecture scope 


In this lecture, we covered from section 2.6.5 to to 
the end of chapter 2 in the book: Digital image 
processing 3ED, By Gonzalez and Woods 
Workout exercises:) 



